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Site-Specific Recombination in Eukarvotes 
and Constructs Useful Therefor 

FIELD OF THE INVENTION 

The present invention relates to methods for 
manipulating chromosomal sequences in cells by site- 
specific recombination promoted by recombinases . In a 
5 particular aspect, the present invention relates to methods 
for producing embryonic stem cells bearing nucleic acid 
sequences that have been rearranged by a site-specific 
recombinase expressed from a construct controlled by a 
tissue-specific promoter (e.g., a germline specific 
10 promoter). In another aspect, the present invention 
relates to methods for producing embryonic stem cells 
bearing nucleic acid sequences that have been rearranged by 
a site-specific recombinase expressed from a construct 
controlled by a conditional promoter. 

15 BACKGROUND OF THE INVENTION 

The analysis of gene function has increasingly come to 
require the production of subtle, tissue-specific, and 
conditional mutations in animals and plants. Although 
there are a number of methods for engineering subtle 

20 mutations in embryonic stem (ES) cells (Hasty et al . (1991) 
Nature 350:243-246, Askew et al . (1993) Mol Cell Biol 
13:4115-4124), the use . of site-specific recombinases to 
remove the selectable marker that permits isolation of 
homologously recombined ES cell clones has become 

25 increasingly prevalent (Kitamoto et al . (1996) Biochem 
Biophys Res Commun 222:742-747 , Fiering et al . (1993) Proc 
Natl Acad Sci USA 90:8469-8473, Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081; Gu et al . (1993) Cell 
73:1155-1164; Sailer et al . (1996) Taniguchi Symposia on 



Brain Sciences, eds. Nakanishi et al . (Japan Scientific 
Press) , pp. 89-98) . 

Site -specific recombinases represent the best method 
for creating tissue- specif ic and conditional mutations in 
animals and plants, being employed first to remove the 
selectable marker to create a functionally wild-type 
allele, and then to inactivate the allele mosaically in 
animals and plants by removing some essential component in 
a tissue-specific or conditional manner (Gu et al . (1994) 
Science 265:103-106; Kuhn et al . (1995) Science 
269:1427-1429). Current protocols for using excissive 
site -specific recombination to remove selectable markers 
include transiently transfecting ES cell clones with a 
recombinase expression vector (Gu et al . (1993) cell 
73:1155-1164) , microinjecting fertilized oocytes containing 
the recombinant allele with a recombinase expression vector 
(Kitamoto et al . (1996) Biochem Biophys Res Commun 
222:112-111; Araki et al . (1995) Proc Natl Acad Sci USA 
92:160-164), or breeding animals and plants containing the 
recombinant allele to animals and plants, respectively, 
containing a recombinase transgene (Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081; Lewandoski et al . (1997) 
Curr Biol 7:148-151) . Each of these approaches requires an 
investment of some combination of time, resources, and 
expertise over that required to generate animals and plants 
with homologously recombined alleles. The most commonly 
employed method, the secondary transfection of homologously 
recombined ES cell clones with a recombinase expression 
vector, additionally requires extended culture time that 
may decrease their potential to enter the germline. 

In principle, marker excision would be substantially 
simplified through the use of ES cells containing 
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recombinase nucleic acid constructs that were expressed in 
the germline, but not to an appreciable extent in the ES 
cells themselves or somatic tissues of animals and plants. 
The lack of ES cell expression would mean that targeting 
5 vectors containing selectable markers flanked by 
recombinase target sites could be used to isolate 
homologous recombinants without fear that the marker would 
be excised during culture . Robust recombinase expression 
in gametes would mean that the marker would be excised in 

10 at least some of the progeny of ES cell chimeras. Only a 
single step would be required to isolate subtle mutations 
and, if two different recombinase systems were employed, 
conditional and tissue-specific alleles could be produced 
with similar improvements in efficiency. A 

15 germline-specif ic recombinase nucleic acid construct could 
also be used to deliver recombined target nucleic acid 
constructs to the early embryo (Lewandoski et al . (1997) 
Curr Biol 7:148-151), so long as the recombined target was 
not detrimental to the terminal stages of spermatogenesis. 

20 Previous reports have shown that expression of nucleic 

acid constructs containing the proximal promoter of the 
mouse protamine 1 (mPl) locus is restricted to haploid 
spermatids in mature mice (Peschon et al . (1987) Proc Natl 
Acad Sci USA 84:5316-5319; Behringer et al . (1988) Proc 

25 Natl Acad Sci USA 85:2648-2652), although low levels of 
ectopic expression may occur in some mature tissues 
(Behringer et al . (1988) Proc Natl Acad Sci USA 
85:2648-2652). Inclusion of the mPl promoter does not 
guarantee expression in the male germline, however, for 

30 although nucleic acid constructs containing the mPl 
promoter and the SV4 0 T- antigen coding sequence were 
transcribed, the message was not translated at detectable 
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levels in spermatids (Behringer et al . (1988) Proc Natl 
Acad Sci USA 85:2648-2652) . 

Accordingly, there is a need in the art for methods to 
modulate expression of recombined target nucleic acid 
5 sequences in the early embryo. In addition, there is a 
need in the art for tissue-specific and conditional 
recombinatory tools to create transgenic animals and 
plants. These and other needs in the art are addressed by 
the present invention. 

10 BRIEF DESCRIPTION OF THE INVENTION 

The present invention meets the need in the art for 
modulating expression of recombined target nucleic acid 
sequences to the early embryo. The present invention 
further meets the need in the art for tissue-specific and 

15 conditional recombinatory tools to create transgenic 
animals and plants. Thus, in accordance with the present 
invention, it has been discovered that nucleic acid 
constructs encoding a germline specific promoter 
operatively associated with a recombinase coding sequence 

20 lead to efficient recombination of a target nucleic acid 
construct in the male germline, but not in other tissues. 
This suggests that such nucleic acid constructs could be 
used for the efficient production of embryos bearing 
conditional, genetically lethal alleles. It has 

25 additionally been discovered that ES cell lines generated 
from one of these transgenic lines could be used in 
combination with targeting vectors that contained 
loxP- flanked selectable markers to isolate homologous 
recombinants containing the marker and functional loxP 

30 sites. 
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RRTEF DESCRIPTION OF THK FIGURES 

Figure 1 illustrates a schematic of P2Bc and P2Br 
alleles. The positions of the PCR primers used (5'P and 
3'P) are indicated on the diagrams of the P2Bc and P2Br 
5 alleles. 

Figure 2 depicts the targeting of the hoxb-1 locus in 
ProCre ES cells using a targeting vector that contains a 
loxP- flanked selectable marker. Top, schematic of the 
wild-type hoxb-1 locus showing the positions of the two 

10 exons (open boxes), the position of a 5 ' Nrul site and 
flanking BamHI restriction endonuclease sites, and PCR 
primers (triangles) that amplify a 204 bp product from the 
wild-type allele that includes the Nrul site. Middle, the 
predicted organization of homologously recombined hoxb-1 

15 allele in which a neomycin cassette (NEO) , flanked by loxP 
sites (L) , has been inserted into the Nrul site shown in 
the top diagram. The insertion creates a novel BamHI site 
and the same PCR primers now amplify a 1600 bp product. 
Bottom: the predicted structure of the recombined allele 

20 shown in the middle panel after Cre-mediated excision of 
the neomycin cassette to leave a single loxP site in place 
of the Nrul site of the wild-type allele. Amplification 
with the same primers now yields a 268 bp product. 

DETAILED DESCRIPTION OF THE INVENTION 

25 in accordance with the present invention, there are 

provided nucleic acid constructs comprising a germline- 
specif ic promoter operatively associated with a recombinase 
coding sequence. 



6 

As used herein, the term "promoter" refers to a 
specific nucleotide sequence recognized by RNA polymerase, 
the enzyme that initiates RNA synthesis. The promoter 
sequence is the site at which transcription can be 
5 specifically initiated under proper conditions. The 
recombinase nucleic acid(s), operatively linked to the 
suitable promoter, is (are) introduced into the cells of a 
suitable host, wherein expression of the recombinase 
nucleic acid(s) is (are) controlled by the promoter. 

10 Germline-specif ic promoters contemplated for use in 

the practice of the present invention include the protamine 
1 gene promoter, the protamine 2 gene promoter, the 
spermatid-specif ic promoter from the c-kit gene (Albanesi 
et al. (1996) Development 122 (4) : 1291-1302) , the sperm- 

15 specific promoter from angiotensin-converting enzyme 
(Howard et al . (1993) Mol Cell Biol 13 (1) : 18-27; Zhou et 
al. (1995) Dev Genet 16 (2) : 201-209) , oocyte specific 
promoter from the ZP1 gene, oocyte specific promoter from 
the ZP2 gene, oocyte specific promoter from the ZP3 gene 

20 (Schickler et al . (1992) Mol Cell Biol 12 (1) : 120-127) , and 
the like. 

In addition to the above -described germline-specif ic 
promoters, tissue-specific promoters specific to plants are 
also contemplated for use in the practice of the present 

25 invention, including, for example, the LAT52 gene promoter 
from tomato, the LAT56 gene promoter from tomato, the LAT5 9 
gene promoter from tomato Eyal et al . (1995) Plant Cell 
7(3) :373-384) , the pollen-specific promoter of the Brassica 
S locus glycoprotein gene (Dzelzkalns et al . (1993) Plant 

30 Cell 5 (8) :855-863) , the pollen-specific promoter of the 
NTP303 gene (Weterings et al . (1995) Plant J 8(l):55-63), 
and the like. 
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Recombinases contemplated for use in the practice of 
the present invention include Cre recombinase, FLP 
recombinase, the R gene product of Zygosaccharomyces 
(Onouchi et al . (1995) Mol Gen Genet 247 (6) : 653-660) , and 
5 the like. 

Presently preferred constructs contemplated for use in 
the practice of the present invention include ProCre 
(comprising the protamine 1 gene promoter operatively 
associated with Cre recombinase) , ProFLP (comprising the 
10 protamine 1 gene promoter operatively associated with FLP 
recombinase) , ProR (comprising the protamine 1 gene 
promoter operatively associated with the R gene product of 
Zygosaccharomyces) , and the like. 

In accordance with another embodiment of the present 
15 invention, there are provided nucleic acid constructs 
comprising a conditional promoter or a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence . 

Promoters contemplated for control of expression of 
2 0 recombinase nucleic acid(s) employed in accordance with 
this aspect of the present invention include inducible 
(e.g., minimal CMV promoter, minimal TK promoter, modified 
MMLV LTR) , constitutive (e.g., chicken 3-actin promoter, 
MMLV LTR (non-modified) , DHFR) , and/or tissue specific 
2 5 promoters. 

Conditional promoters contemplated for use in the 
practice of the present invention comprise transcription 
regulatory regions that function maximally to promote 
transcription of mRNA under inducing conditions. Examples 
30 of suitable inducible promoters include DNA sequences 
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corresponding to: the E. coli lac operator responsive to 
IPTG (see Nakamura et al . , Cell, 18:1109-1117, 1979); the 
metallothionein promoter metal -regulatory-elements 
responsive to heavy-metal (e.g., zinc) induction (see Evans 
5 et al., U.S. Patent No. 4,870,009), the phage T7lac 
promoter responsive to IPTG (see Studier et al . , Meth. 
Enzymol., 185: 60-89, 1990; and U.S. #4,952,496), the heat - 
shock promoter; the TK minimal promoter; the CMV minimal 
promoter; a synthetic promoter; and the like. 

10 Exemplary constitutive promoters contemplated for use 

in the practice of the present invention include the CMV 
promoter, the SV4 0 promoter, the DHFR promoter, the mouse 
mammary tumor virus (MMTV) steroid- inducible promoter, 
Moloney murine leukemia virus (MMLV) promoter, elongation 

15 factor la (EFla) promoter, albumin promoter, APO Al 
promoter, cyclic AMP dependent kinase II (CaMKII) promoter, 
keratin promoter, CD3 promoter, immunoglobulin light or 
heavy chain promoters, neurof iliment promoter, neuron 
specific enolase promoter, L7 promoter, CD2 promoter, 

20 myosin light chain kinase promoter, HOX gene promoter, 
thymidine kinase (TK) promoter, RNA Pol II promoter, MYOD 
promoter, MYF5 promoter, phophoglycerokinase (PGK) 
promoter, Stfl promoter, Low Density Lipoprotein (LDL) 
promoter, chicken (5-actin promoter (used in conjunction 

25 with ecdysone response element) and the like. 

As readily understood by those of skill in the art, 
the term "tissue specific" refers to the substantially 
exclusive initiation of transcription in the tissue from 
which a particular promoter, which drives expression of a 
30 given gene, is derived (e.g., expressed only in T-cells, 
endothelial cells, smooth muscle cells, and the like) . 
Exemplary tissue specific promoters contemplated for use in 



ch e practice of the present invention include the GH 
propter, the «SE propter, the GFAP promoter, 
neurotransmitter promoters (e.g., tyrosine hydroxylase, TH, 
choline acetyltransf erase. ChAT, and the like) , promoters 
5 for neurotropic factors (e.g., a nerve growth factor 
promoter, NT-3, BDNF promoters, and the like), and so on. 

In accordance with yet another embodiment of the 
present invention, there are provided embryonic stem cells 
containing a nucleic acid construct as described herein. 

L0 as readily understood by those of skill in the art, 

the above-described constructs can be introduced into a 
variety of animal species, such as, for example, mouse, 
rat rabbits, swine, ruminants (sheep, goats and cattle), 
humans, poultry, fish, and the like. Transgenic 
15 amphibians, insects, nematodes, and the like, are also 
contemplated. Members o£ the plant kingdom, such as, for 
example, transgenic mono- and dicotyledonous specks, 
including important crop plants, i.e., wheat, rice, maxze, 
soybean, potato, cotton, alfalfa, and the like, are also 
20 contemplated. 

For example, pluripotential ES cells can be derived 
from early pre- implantation embryos, preferably the ova are 
harvested between the eight-cell and blastocyst stages. ES 
25 cells are maintained in culture long enough to pernut 
integration of the promoter -recombinase nucleic add 
construct (s). The cells are then either injected xnto a 
host blastocyst, i.e., the blastocoel of the host 
blastocyst, or co-cultured with eight-cell to morula-stage 
30 ova, i.e., zona-free morula, so that transfected ES cells 
are preferentially incorporated into the inner cell mass of 
the developing embryo. With blastocyst injection, 
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transgenic offspring are termed "chimeric, H as some of 
their cells are derived from the host blastocyst and some 
transfected ES cells. The host embryos are transferred 
into intermediate hosts or surrogate females for continuous 
development . 

The transformation procedure for plants usually relies 
on the transfer of a transgene carrying a particular 
promoter construct via the soil bacterium Agrobacterium 
tumefaciens . Transformation vectors for this procedure are 
derived from the T-DNA of A. tumefaciens, and transgenes 
are stably incorporated into the nuclear genome. The 
activity of the transgenes can then be monitored in the 
regenerated plants under different conditions. In this 
way, many promoter elements that are involved in complex 
regulatory pathways such as light responsiveness or tissue 
specificity have been defined. 

Alternatively, direct (i.e., vectorless) gene 
transfer systems are also contemplated including chemical 
methods, electroporation, microinjection, biolistics, and 
the like. Protoplasts isolated from the plants can be 
obtained by treatment with cell wall degrading enzymes. 
DNA can be introduced into plant protoplasts by a number of 
physical techniques including electroporation and 
polyethylene glycol treatment in the presence of MgCl 2 . 
The method of choice for rapid promoter analyses in plants 
is the biolistic method. This technique involves the 
delivery of the particular DNA construct into plant cells 
by microprojectiles , i.e., nucleic acid(s) coated or 
precipitated by tungsten or gold. This method is not 
limited to any particular plant species or tissue type. 
Preferably, this method would allow quantitative analysis 



11 



10 



of transformation if appropriate selectable markers are 
included. 

In a preferred embodiment, the genome of embryonic 
stem cells according to the invention comprise a 
transcriptionally active selectable marker flanked by two 
recombination target sites. It is especially preferred 
that the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline- specific 
promoter is selective for the recombination target sites 
flanking said selectable marker. 



Optionally, embryonic stem cells according to the 
invention may further comprise one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
15 different than the recombination target sites which flank 
said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence, 

20 a second nucleic acid construct comprising a tissue- 

specific promoter operatively associated with a second 
recombinase coding sequence, or the like. Preferably, the 
second recombinase coding sequence will be different than 
the first recombinase coding sequence. 

25 The ability to select and maintain nucleic acid 

constructs in the host cell is an important aspect of an 
expression system. The most common type of selectable 
marker incorporated in the nucleic acid construct is an 
antibiotic resistance element allowing selection with 

30 ampicillin, kanamycin, neomycin, tetracycline, hygromycin, 
puromycin, blastophycin, and the like. Other approaches 
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employ specially constructed host cells which require the 
selectable marker for survival. Such selectable markers 
include the valine tRNA synthetase, val S, the 
single-stranded DNA binding protein, ssb, thymidine kinase, 
or the like. Alternatively, naturally occurring partition 
systems that maintain copy number and select against 
plasmid loss is also contemplated. An example is the 
incorporation of the parB locus. Other selectable markers 
include HPRT and the like. 

Selectable markers specific for plants include, the 
gus A (uid A) , the bar gene, phosphinothricin and the like. 

In accordance with still another embodiment of the 
present invention, there are provided methods for excission 
of the transcriptionally active selectable marker from the 
above-described embryonic stem cells, said method 
comprising: 

passaging the genome derived from said embryonic stem 
cells through gametogenesis (i.e., spermatogenesis or 
oogenesis) . 

Excission of marker as contemplated herein can cause 
a variety of end results, e.g., deletion of the marker or 
a nucleic acid sequence, gain of function or loss of 
function, replacement of function, and the like, as well as 
modulation of any one or more of these results. 

5 Functions which are contemplated to be manipulated 

include regulating body size and growth rate, including 
recombining gene constructs which contain various growth 
hormone gene sequences. Other productivity traits that are 
targets include altering the properties or proportions of 

0 caseins, lactose, or butterfat in milk, increased 
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resistance to viral and bacterial diseases (i.e., 
"constitutive immunity" or germ- line transmission of 
specific, recombined antibody genes) , more efficient wool 
production, and the like. Other functions which are 
5 contemplated to be modulated include development of lines 
of transgenic animals and plants for use in directing 
expression of transgenes encoding biologically active human 
proteins . 

Agronomic traits which are contemplated to be 
10 modulated by use of the present invention include tolerance 
to biotic an abiotic stress, increased resistance to 
herbicides, pest damage, and viral, bacterial, and fungal 
diseases, improvement of crop quality (i.e., increase in 
nutritional value of food and feed) , reduction of post- 
15 harvest losses, improvement of suitability and enlargement 
of the spectrum for processing (i.e., altered quantity and 
composition of endogenous properties, production of new 
compounds of plant or non-plant origin such as biopolymers 
or pharmaceutical substances) . 

20 In accordance with a still further embodiment of the 

present invention, there are provided methods for the 
production of recombinant alleles, said method comprising: 
introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
25 cells as described herein, and 

passaging the genome derived from said embryonic stem 
cells through gametogenesis . 

As readily recognized by those of skill in the art, 
nucleic acid fragments can be introduced into ES cells by 
30 a variety of techniques, e.g., by homologous recombination, 
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-retroviral insertion, site specific- 
random insertion, retroviral 

mediated recombination, and the like. 

Nucleic acid fragments contemplated for use herein 
include fragments containing an essential portion of a gene 
of interest. 

in accordance with yet another embodiment of the 
present invention, there are provided methods for the 
pro duction of recombinant alleles, said method comprising: 

introducing at least one recombinase responsive construct 
into embryonic stem cells as described herein, 

wherein said construct (s) comprise (s) a nucleic 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 
3 wherein said nucleic acid fragment is flanked by 

a second pair of recombination target sites, 

passaging the genome derived from said embryonic stem cells 
through gametogenesis . 

in a presently preferred aspect, the first pair of 

w ^on faraet sites is recognized by a recombinase 
!0 recombination target sites. *=» 

which is expressed under the control of a germline-specif ic 
promoter and said second pair of recombination target sites 
is recognized by a recombinase which is expressed under the 
control of a conditional promoter or a tissue specific 
25 promoter. 

Optionally, the embryonic stem cells employed herein 
can further comprise a second nucleic acid construct 
selected from constructs comprising a conditional promoter 
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o.eratively associated with a recombinase coding sequence, 
operatives tissue-specific promoter 

a construct comprising a tissue sp 

operatives associated with a recombinase coding sequence, 
and the like. 

fl Hii another embodiment of the 
In accordance with still anotne* 

present invention, there are provided methods for the 
conditional assembly of functional gene(s) for expression 
in eukaryotic cells by recombination of individual inactive 
gene segments from one or more gene(s) of interest, 

wherein each of said segments contains at least one 
recombination target site, and 

wherein at least one of said segments contains at 
least two recombination target sites, 



said method comprising: 

introducing said individual inactive gene 
segments into an embryonic stem cell as descried 
herein, thereby providing a DNA which encodes . 
functional gene o£ interest, the expression product of 
which is biologically active, upon passage of the 
genome derived from said stem cells through 
gametogenesis. 

Por assembly of functional genes from inactive gene 
segments, see, for example, US Patent No. 5,654,182, 
incorporated herein by reference in its entirety. 

25 in accordance with a still further embodiment of the 

present invention, there are provided methods for the 
generation of recombinant livestock, said method 

comprising: 

combining embryonic stem cells that include nucleic 
30 acid construct according to the invention with host 
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pluripotential ES cells derived from early pre -implantation 
embryos , and 

introducing these combined embryos into a host female 
and allowing the derived embryos to come to term. 

5 

In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
generation of recombinant plants, said method comprising 
transforming plant zygotes with nucleic acid constructs 
10 according to the invention and allowing the zygote to 
develop. 



The objective of the current work with ProCre nucleic 
acid constructs was to determine the potential of 
germline-specif ic promoters to implement efficient 

15 approaches utilizing site-specific recombinases to generate 
an array of sophisticated mutations in mammals and plants. 
The data shows that it is possible to create recombinase 
nucleic acid constructs that are expressed at high levels 
in the germ line but not to a functionally significant 

20 extent in either ES cells or embryonic or adult somatic 
tissues. Homologous recombinants with a selectable marker 
can be isolated in ES cells that contain 
promoter- recombinase nucleic acid constructs. Transgenic 
animals and plants bearing the promoter- recombinase nucleic 

25 acid constructs and a target allele transmit the recombined 
target to their progeny at high frequencies. These results 
establish the principle that mammals and plants containing 
loci that have been homologously recombined and then 
subsequently site-specifically recombined can be generated 

30 simply by using ES cells with a suitable recombinase 
nucleic acid constructs for the initial targeting. By this 
mechanism, alleles containing a single recombinase target 
site and a mutation of interest can be produced in the 
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progeny of ES cell chimeras without any investment of time, 
e*TerJ- or resources over that retired to create an 
anele that still contains a selectable marker. The 
P. X. has obvious utility in the production 
ana conditional mutations that retire generate of 
alleles with minimal structural alterations. Because the 
^sence and transcriptional activity of selectable marKers 
can contribute to phenotypes in an unanticipated and 

^ *l (1995) Genes Dev 
unwanted manner (Fierxng et al. (1995, 

9:2203-2213); Olson et al . .1996, 85 ^ 

approach will also useful for generating null alleles. 

Expression of the endogenous mPl locus (Hecht et al. 
,!,.<) E *P cell *es 1.4.1.3-190). and mPl-driven nucle.c 
acl d constructs (Behringer et al. (1988, Proc Natl Aa^S 
„ B A 85 = 2648-2652.- Braun et al. (1989, Nature 337:373-376 
Zambrowicz et al. (1993) Proc Natl Acad Sci VBA 
,0.5071-5075) is ■ restricted to haploid spermatrds. 
Kxpression of mP! nucleic acid construct expression 
typically begins at haploid stages, and both KKA (Caldwel 
Z Handel ,199!) Proc Natl Acad sci USA 
and proteins (Braun et al. (1989) Nature ,17,373-37.) 
diffuse through the spermatogenic syncytium. The result rs 
a highly efficient recombination of target alleles and the 
segregation of recombinase and target nucleic acid 
25 constructs in the first generation. 

cre-mediated recombination proved to be highly testis- 
specific in Procre mice. It is clear that the nucleic acrd 
constructs are not expressed in the inner cell mass or , 
other early embryonic tissues. Cells from pre-implantation 
30 embryos intermingle extensively and the embryo as a whole 
is derived from a small number of cells (Beadington et al 
,1989) Development 106:37-46,- Soriano and Jaenisch (1986) 
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Cell 46:19-29). If ProCre nucleic acid constructs 
recombined target sequences during pre -implantation stages, 
at least a few percent of the cells in many tissues would 
contain the P2Br allele and Southern and PCR analyses 
showed that this was not the case. The ectopic Cre 
activity seen in some ProCre strains probably resulted from 
low levels of recombinase expression in later embryos or 
mature tissues, a finding consistent with the expression 
patterns of other mPl-driven nucleic acid constructs. 
Northern analyses have failed to reveal the expression of 
mPl- containing nucleic acid constructs in a variety of 
mature tissues (Peschon et al . (1987) Proc Natl Acad Sci 
USA 84:5316-5319; Behringer et al . (1988) Proc Natl Acad 
Sci USA 85:2648-2652; Peschon et al . (1989) Ann N Y Acad 
Sci 564:186-197; Zambrowicz et al . (1993) Proc Natl Acad 
Sci USA 90:5071-5075), but nucleic acid constructs 
containing the mPl promoter and the SV4 0 T-antigen led to 
the consistent development of tumors pf the petrosal bone 
and right cardiac atrium (Behringer et al . (1988) Proc Natl 
Acad Sci USA 85:2648-2652) . 

PCR assays represent a very sensitive assay for 
whether sufficient levels of Cre protein were produced to 
effect recombination. Importantly, they measured the 
cumulative level of recombination, for events that occurred 
at any stage of development are likely to have been 
propagated to, and might be amplified in, descendant 
populations. The highest level of ectopic recombination 
was that observed in cardiac ventricular tissue of 
strain which generated a signal approximately equivalent to 
that expected if the ratio between recombined and 
unrecombined alleles were 1:104. The activities observed 
in other strains were considerably lower than this, and one 
strain did not show any ectopic activity. None of the 
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strains showed evidence of recombination in the cardiac 
atria and the petrosal bone was not examined. These assays 
did not rule out the possibility that higher levels of 
recombination occur in tissues that were not examined or 
that the low levels of recombination observed in some 
tissues reflected high levels of recombination in some 
component cell population. 

These low levels of ectopic activity suggest that 
m pl-driven recombinase nucleic acid constructs could be 
used for the production of embryos containing generally 
lethal alleles. Some alleles created by homologous 
recombination in ES cells will prove to be lethal in 
heteroses, as was the case for an mRNA editing mutation 
of the G1UR2 glutamate receptor subunit (Brusa et al . 
(1995) Science 270:1677-1680) . Germline transmission would 
be restricted to rare chimeras in which the level of 
chimerism was low enough in tissues affected by the 
mutation to allow survival and high enough in the germline 
to allow transmission. This problem could be circumvented 
0 by creating recombinase -conditional mutations in ES cells 
bearing mpl -recombinase nucleic acid constructs, or by 
making the same mutations in standard ES cells and then 
introducing the mpl -recombinase nucleic acid construct by 
breeding. So long as the recombined version of the allele 
5 did not adversely impact terminal stages of 
spermatogenesis, embryos containing the recombined allele 
could be efficiently produced. Embryos containing 
recombined nucleic acid constructs can also be produced 
through the activity of Cre nucleic acid constructs that 
30 are expressed during early embryogenesis from the human 
cytomegalovirus minimal promoter (Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081), the adenovirus Ella 
promoter (Lakso et al. (1992) Proc Natl Acad Sci U S A 



T T W SSI M VTUU 



20 

89:6232-6236), or the zP3 promoter (Lewandoski et al . 
(1997) Curr Biol 7:148-151). ProCre and zP3 nucleic acid 
constructs have the advantage of delivering a recombined 
allele to the zygote, guaranteeing that all cells in the 
5 derived embryos will contain the allele. 

ProCre ES cells are but one of many different kinds of 
recombinase-bearing ES cells that could significantly 
shorten the time and effort required for a wide v variety of 
genetic manipulations in mice. The most obvious of these 

10 are complementary ProFLP ES cells in which the FLP 
recombinase was derived from S. cerevisae (Broach and Hicks 
(1980) Cell 21:501-508) or another species (Kuhn et al . 
(1995) Science 269:1427-1429). Conceptually distinct from 
these but perhaps as generically useful would be ES cells 

15 bearing inducible recombinase nucleic acid constructs that 
would facilitate temporal control of recombinase expression 
in ES cells, chimeras, and their progeny to generate 
site-specifically recombined alleles (Araki et al . (1992) 
J Mol Biol 225:25-37; No et al . (1996) Proc Natl Acad Sci 

20 USA 93:3346-3351; Logie and Stewart (1995) Proc Natl Acad 
Sci USA 92:5940-5944; Feil et al . (1996) Proc Natl Acad 
Sci USA 93:10887-10890) . Finally, fusion genes that led 
to recombinase expression in specific tissues could be used 
to address specific research objectives. 

25 The invention will now be described in greater detail 

by reference to the following non-limiting examples. 

Example 1 
Mammalian DNA Constructs 

A 652 bp fragment of the mPl promoter (SEQ ID N0:1; 
30 Peschon et al . (1989) Annals of the New York Academy of 



Sciences 186-197) was isolated by PCR using PCR primers 
(SEQ ID N0s:2 and 3) and genomic DNA templates from CCE ES 
cells (Robertson et al . (1986) Nature 323:445-448). This 
fragment was fused to a modified Cre coding sequence (SEQ 
ID NO: 4) which contains a consensus translation start site 
(Kozak (1986) Cell 44:283-292) , 11 codons for a human c-myc 
epitope (Evan et al . (1985) Mol Cell Biol 5:3610-3616), 
7 codons for a minimal SV4 0 nuclear localization signal 
(Kalderon et al . (1984) Cell 39:499-509) and the 
polyadenylation signal from pIC-Cre in the plasmid pOG304M 
(SEQ ID NO: 5) . The Cre expression plasmid pOG231 was 
prepared by fusing a modified Cre coding sequence from 
pIC-Cre (Gu et al . (1993) Cell 73:1155-1164), and 
containing the same translation start and nuclear 
localization signal, to the synthetic intron and CMV 
promoter of pOG44 (O'Gorman et al . (1991) Science 
251:1351-1355) . 

A plasmid, pOG277 (SEQ ID NO:7), containing a 
loxP- flanked neomycin cassette was prepared by inserting a 
wild-type loxP site (SEQ ID NO: 8; Hoess et al . (1982) Proc 
Natl Acad Sci USA 79:3398-402) into pBSKS (Stratagene) 
and then cloning the neomycin expression cassette from 
pMClneo-polyA (Thomas et al . (1987) Cell 51:503-512) 
between interactions of this loxP site. The hoxb-1 
targeting construct consisted of the PGK-TK cassette from 
pPNT (Tybulewicz et al . (1991) Cell 65:1153-63), and 1.4kb 
and 10.2kb of sequences 5' and 3' to an Nru I site 800 bp 
5' to the hoxb-1 transcriptional start site isolated from 
a 129 strain genomic library (Stratagene) . The 
loxP- flanked neo cassette from pOG277 was inserted into the 
Nrul site. The pOG277 neomycin cassette and a 3 -GAL 
sequence was inserted into the first exon of the large 
subunit of RNA polymerase II (RP2) (Ahearn et al . (1987) 
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j Biol. Chem. 262:10695-10705) to create the P2Bc allele 
(Figure 1) . Cre-mediated recombination of the P2Bc allele 
results in the deletion of the neomycin cassette (Neo) of 
P2Bc that is flanked by two loxP sites, leaving a single 
5 loxP site and fusing the B-Gal coding sequence to the 
initial codons of the RNA polymerase II coding sequence. 
Recombination increases the size of a Pst I fragment 
recognized by the RP2 probe, which is external to the 
targeting vector used, indicated by the shaded box below 
10 each allele. 



F.xample 2 
PrnHi ^t-ion of transg enic mice 

Fertilized oocytes obtained from matings of 129/SvJae 
(Simpson et al . (1997) Nat Genet 16:19-27) and BALB/c X 
15 C57BL/6 Fl mice were used for pronuclear injections of the 
Protamine-Cre fusion gene from pOG304M according to 
standard protocols (Hogan et al. Manipulating the Mouse 
Embryo: The Manual, Coldspring Harbor Press (1994), pg. 
497) . Production of ES cells and homologous recombinants: 
20 Heterozygous ProCre 129/SvJae males were mated to 
129/SvEms- + Ter VJ females (Simpson et al . (1997) Wat Genet 
16:19-27) to produce blastocysts that were cultured 
according to standard protocols (Robertson (1987) 
Teratocarcinomas and embryonic stem cells, a practical 
25 approach, eds . E. J. Robertson (IRL Press), pp. 71-112). 
The sex (King et al. (1994) Genomics 24:159-68) and ProCre 
status of each line were determined by PCR assays. 
Molecular analyses: Tail biopsy genomic DNA was used for 
hybridization assays or PCR assays to identify ProCre and 
30 P2Bc/r mice. PCR reactions for the detection of ectopic 
Cre activity used 100 ng of genomic DNA as a template to 
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i « P2Br-specific product using a 5' primer from the 

M „rl2 1 5 units of Taq polymerase, ana 

f 6 0°C Southern blots of reaction products 
temperature of 60 C. Soutn reaction 
were hybridized with a probe specif xc for the 



product . 



10 




A total of nine founder animals with ProCre nucleic 

•„ instructs were obtained from injections of 
acid constructs ed £rom 

1S Protamine-Cre fusion gene Two line 
injections of 129SvJae (Simpson et al. 

embryos, and seven fro. injections of CWF2 
16.19 » rt randomly selected 

embryos. The 129/SvJae lines and three ran y 
embryos d . ta il To determine whether 

hybrid lines were examined in detail^ e 
„ Procre nucleic acid constructs would ef f icien y 
a target allele, males were 
ProC re nucleic acid construct and a target 

_ ■ „ This "P2Bc n {Rol II, Jl ^ AiJ ' - 
rSCO t gure 1, was created using homologous 
*■ ! on In ES cells to insert a loxP-flanxed neomycin 
25 recombination in ES cells in , 0 the first exon of 

cassette and a P-GAL coding sequence into the f ir 

■F^r- t-he larqe subunit of RNA 
"Ur; ere- later recolination of the loxP 

fir was 6 expected to delete the intercalated sequences. 
30 creating -MBr- allele (£ol U. fi-Oal. E ecom>med> . 

These males were mated tc wild-type females and the 
resulting progeny were examined by Southern blotting 
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determine if they inherited the P2Bc or the P2Br allele, 
and to additionally determine the segregation pattern of 
ProCre nucleic acid constructs and P2Br alleles. Southern 
blot of Pst I digested tail biopsy DNA's from a +/P2Bc, 
+/ProCre male (sire) and four of his progeny by a wild- type 
female probed with n RP2 probe (top) and then reprobed with 
a Cre probe (bottom) . The large majority of transmitted 
target alleles were Cre-recombined P2Br alleles (Table 1) . 
ProCre nucleic acid constructs and recombined target 
alleles segregated independently in the first generation; 
approximately 50% of mice that inherited a P2Br allele also 
inherited their male parent's ProCre nucleic acid 
construct. All RP2 mutant alleles in the progeny were 
P2Br, and some progeny inherit a P2Br allele without 
inheriting ProCre nucleic acid construct. Mouse 4 did not 
contain a ProCre nucleic acid construct and is homozygous 
wild- type at the RP2 locus. These data establish that 
ProCre nucleic acid constructs efficiently recombine the 
P2Bc allele in the male germline and that the recombined 
P2Br alleles and ProCre nucleic acid constructs segregate 
in the first generation. Because significantly more than 
25% of the progeny inherited recombined target alleles, 
recombination either occurred during diploid stages of 
spermatogenesis or Cre generated during haploid stages of 
spermatogenesis was distributed among spermatids through 
cytoplasmic bridges (Braun et al . (1989) Nature 
337:373-376), effecting recombination in spermatids that 
did not themselves contain a ProCre nucleic acid construct. 

The progeny of matings between ProCre males and +/P2Bc 
females were also examined to determine if male gametes 
from ProCre mice delivered enough Cre to zygotes to effect 
Cre-mediated recombination of a target sequence. Of 96 
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progeny examined by Southern blotting, none contained a 
Cre-recombined P2Br allele. 

It has also been discovered that a loxP- flanked neo 
cassette in the glutamate receptor R6 subunit locus is 
5 efficiently recombined by ProCre nucleic acid constructs in 
mice . 



Example 4 

ProCre Nucleic acid construct E xpression is Highly 

Ti asne-Specif ic 

10 Genomic DNAs from ten different tissues of five- to 

seven-week old males that contained both a ProCre nucleic 
acid construct and a P2Bc target allele were analyzed in 
Southern blots. Southern blots were prepared of Pst I 
digested DNA from testes (T) and one other tissue (K, 
15 kidney; B, brain; S, spleen) of males heterozygous for one 
of four ProCre nucleic acid constructs and the P2Bc allele. 
Testis DNA from each male shows a P2Br allele signal, in 
addition to those generated by the wild-type RP2 (WT) and 
P2Bc alleles. Other tissues show only the WT and P2Bc 
20 signals. Only the testis samples showed signal indicating 
Cre-mediated recombination of the target. The intensity of 
the P2Br signal relative to that of the wild-type allele 
ranged from 10% to 22% for different ProCre strains and did 
not correlate with the ProCre nucleic acid construct copy 
25 number. The copy number of ProCre nucleic acid constructs 
varied among lines showing similar levels of recombination 
in testis. For example, restriction patterns and 
densitometric analyses showed that line 58 contained a 
single copy of the ProCre nucleic acid construct, yet 
30 showed virtually the same testis recombination signal as 
line containing more than 100 copies. This variability is 



26 



similar to results obtained with other mPl promoter -driven 
nucleic acid constructs (Peschon et al . (1987) Proc Natl 
Acad Sci USA 84:5316-5319; Zambrowicz et al . (1993) Proc 
Natl Acad Sci USA 90:5071-5075). 

5 As a more sensitive measure of ectopic recombination, 

PCR amplifications were performed on the same samples. The 
amplification primers were expected to produce a 325 bp 
product from the recombined target and a 1.4 kb fragment 
from the unrecombined allele (Figure 1) . The assay was 
10 expected to measure the cumulative level of recombination, 
for any P2Br alleles formed during transient expression of 
Cre during development would be preserved and perhaps 
amplified in descendant cells. Low levels of ectopic 
recombination product were observed in some tissues of all 
15 ProCre lines except for one. A southern blot of PCR 
amplification products of the P2Br allele utilized tissues 
from a male heterozygous for the ProCre nucleic acid 
construct and the P2Bc allele. DNA from 10 different 
tissues was amplified using primers and conditions that 
20 produced a 350 bp product from the recombined, P2Br allele. 
Each lane contains 10% of the reactions, except for the 
testis reactions, which were diluted 500 (T5) , 250 (T2) , 
and 100 (Tl) fold prior to loading, and a liver 
reconstruction control (C) , which was diluted 1:100 before 
25 loading. The highest level of ectopic activity was 
observed in cardiac ventricular muscle of mice; in these 
samples the ectopic signal was more than 100 fold lower 
than that observed in testis. Three strains showed much 
lower levels of recombination in brain tissue, and 
30 strain 75 additionally showed ectopic activity in spleen. 
Despite the difficulty of quantifying PCR results, these 
data clearly indicate that ectopic activity occurred at 
very low levels in most tissues of most ProCre lines. 
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~o n lines were established from 
F our male + /ProCre ES cell Imes w« „ 

129/S v strain ProCre transgenic mrce^ J * 

*-«. nasS aae 5 cells from one of these xm 
experiments, passage ^ 5Q 

wer e used to generate three mal 

and 95% coat color chimerism In *ati g ^ 
, un of these male chimeras have sirea a 

fOTaleS ' all bearing the Agouti coat color signify^ 
H pups, all bearing ^ & q£ , 

germline transmiss^n ot the ce g 

pups genotyped additionally conta.n d^U 

n ucleic acid — " ■ been T r ete „ i : ed , no r has it been 
transmission has not yet bee trans mission 
de termined whether competency <« J^^. £S c . u . at 
will persist in homologously recombmed ProCre 

later passages. 

To determine if homologously -combine* ProCre BS cell 

t-araetinq vectors tnac 
-i j icnlated using cargeuxny 

clones could be isolated y marker , two 

, . mA a loxP-flanked selectable marker, 
contained a j-^- tarqeting 
tran sfections were done using giants of 9 

in which a loxP-flanked neomycin casset 
vector m which ^ promoter 

inserted into an Nru I site i d genomic DNAs 

^ A Southern blot of BamHI -digested g 
^rt sted fro. a 9 S-well plate fro. 10 doubly-selected 
- with a probe ^ ^ 

result from homologous recomb.nat.on 
transfections, 12 of 62 (»*) PC3" «- » 
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PCs-derived clones that were ganciclovir and G418 -resistant 
(Mansour et al . (1988) Nature 336:348-352) were found to be 
homologous ly recombined. In two parallel transf ections of 
CCE cells (Robertson et al . (1986) Wature 323:) with the 
same vectors, 32 of 93 (34%) and 15 of 132 (11%) clones 
were homologously recombined. The total numbers of 
G418-resistant clones recovered from ProCre ES cell 
transfections were reduced relative to the parallel CCE 
transf ections . This may be attributable to both 
Cre -mediated excision of the neomycin cassette and to the 
fact that the transfections were done under electroporation 
conditions optimized for CCE cells. 

Because it was formally possible that the homologously 
recombined clones contained inactive loxP sites, five 
homologously recombined PC3 ES cell clones and the parental 
PC3 cell line using the primers shown in Figure 2 were 
either mock transfected or transiently transfected with the 
pOG231 Cre expression vector. For the transient 

transfection assay, DNA was harvested 48 hours after 
transfection and used in PCR assays to assess whether the 
loxP sites in the recombinant clones could be recombined by 
Cre. In all cases a clear recombination signal was 
observed in the pOG231 transfected sample. The recombinant 
clones and parental cell lines show the 204 bp 
amplification product of the wild- type allele, and the 
recombinant clones additionally show a 1600 bp product 
(1600) resulting from amplification across the neomycin 
cassette and a nonspecific 1100 bp amplification product 
(NS) . The pOG231-transfected recombinant clones show an 
additional 268 bp product signaling the Cre-mediated 
excision of the neomycin cassette from the recombinant 
alleles of some cells. Experiments were also done to 
assess the stability of the loxP- flanked neo cassette in 
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ProCre ES cells. Five recombinant clones were grown m the 
presence of G418 for two weeks, and then aliquots of each 
were grown either in the presence or absence of G418 for a 
further 10 days. PGR assays were performed to determine xf 

5 cre-recombined alleles were present in any of these samples 
and none was observed in the mock transfected controls. 
These data suggest that there is not enough Cre activity to 
significantly influence either the ability to isolate 
recombinant clones or the stability of the selectable 

10 markers in those clones, establishing that the loxP sites 
in these clones were functional. 

To determine if there was any detectable Cre activity 
in ProCre ES cells, aliquots of two lines (PC3 and PC5) 
wer e transiently transfected with the targeting vector used 
15 to create the P2Bc allele. DNA was recovered 48 hours 
after transfection and used for PGR amplifications of the 
P2Br plasmid molecules that would be generated by 
extrachromosomal Cre-mediated recombination. Small amounts 
of recombination product were seen in both ProCre ES cell 
20 transfections, and none was observed in parallel samples of 
CCE ES cells. This shows that the ProCre ES cell lines 
express sufficient Cre to recombine some extrachromosomal 
targets when the latter are present at high copy numbers. 
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Example 6 
Plant DNft rr»p«t:ructB 
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To define sequences in the LKT52 and IATS9 promoters 
involved in expression in pollen, proximal promoters were 

„ , •„„ » series of linker substitution 
constructed employing a series or 

mutants using the particle bombardment system (Klein et al^ 
tl 987, Nature 3 2,,70-73; Twell et al. (1989b, Plant Physiol 
91-1270-1274). These experiments were performed by co- 
bombarding the test plasmids (luciferase [M C, - recombinase 
fusions, with reference plasmids (^glucuronidase tGUS] 
fusions) . The latter served as a control for bombardment 
variability and allowed comparisons to be made between 
independent bombardments . 

The context of the -100 promoter in LAT52 and the -115 
promoter in LAT59 was chosen because these promoters 
appeared to be the minimal regions that still con erred 
high levels (25% relative to the available full-length 
propter) of pollen-specific expression (Twell et al. 
t»91) Gen Dev 5 = 496-507). These minimal promoters were 
, then fused to the Cre coding sequence operatively linked to 
the luc gene (Ow et al. (1986) Science 224:856-958) coding 
region, and the resulting plasmids served as a basis for 
creating the nucleic acid constructs. The 1*T52 linker 
substitutions were performed in pHWC. which contain 
5 entire LAT52 S- untranslated region »• UTR) . A series of 
six 9- to 10-bp-long linker substitutions were made in 
P52LUC, spanning the region -84 to -29 (52LS1 to 52LS6) . 
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Example 7 
Tissue Specificity i n Plants 

The results obtained by transient expression in pollen 
and in transgenic plants provided information on the effect 
5 of the various constructs on expression in pollen but not 
on their effect on tissue specificity. A tobacco cell 
culture, TXD {maintained as described by Howard et al . 
(1992) Cell 68:109-118), was, therefore, added as an 
additional component of the transient assay system. The 
10 TXD cell culture was initiated from tobacco mesophyll cells 
and therefore represents somatic tissue, as opposed to the 
gametophytic tissue represented by pollen. Cells in 
culture were chosen, rather than intact tissue, as the 
somatic tissue source because such cells superficially 
15 resemble pollen in that they can be spread out as a 
monolayer on a plate before bombardment. 

In this experiment, translation fusions between the 
luc coding region and either the CaMV 35S promoter drove 
strong expression in cell culture but negligible expression 

20 in pollen, whereas the LAT52 promoter showed the opposite 
pattern of strong activity in pollen and negligible 
activity in cell culture. Thus, the transient assay system 
mimics the expression pattern observed for these promoters 
in transgenic plants (Twell et al . (1991) Genes Dev 5:496- 

25 507) . This differential expression provided us with a tool 
with which to address tissue specificity. 

Example 8 

Plant Transformation and Analysis of Transgenic Plants 



Constructs cloned into pBinl9 were introduced into 
30 tomato (Ly coper a icon esculentum cv VF36) by Agrobacterium 
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r™riouslv described (McCormick 
tumefaciens LBA4404 as P'™^ Agr obacterium 
11991b) Transformation of tomato witn 3 

aliens m Plant Tissue Culture Manual, K. Lrnsey. Ed 
:r; 9 ; M least 20 independent transformers -re 
obtained for each construct. 

Por ^-glucuronidase (GUS, assays. 5 to 20 ,L of 

i f1nwprs of the same plant, was 
pollen, pooled fro. seve ral ^ f lo-- o ^ ^ 

g round directly rn Eppendorf tubes r ^ 

extraction buffer (Jefferson et al. 

f using a Teflon-tipped homo g enizer driven by a toll. 

3907) using , b f luorometrically 

■in tm-iI len was measured j-- 1 -" 

EXPreSS1 °; US acfit V in supernatant of pollen extracts 
assayxng GUS activity P de {sigma) as 

using 2mM 4-methylumbellif eryl o u g 

9 - al (iQ87) EMBO 6:3901-3907). GUS 

substrate (Jefferson et al.(l987) EMB 

corrected for variation in total protein 
activity was correcteu i-<-> 

content using a bicinchoninic acid prote.n assay Krt 

(Pierce, Rockford, Ik) . 

Expression in leaves, flowers, stems, roots, and seed 
uas tested historically by staining with 5-bromo-4- 
. Tloro 3-indolylS-o-^uronide ^-cular Probes Ku 9 ene 

« as described previously «-~ * ^ 0 " alalyzed 
6:3901-3907). Expression in leaves 
f luorometrically as given previously. 

T?.vample 9 

25 jj^Uui) Tinn— ^ of Tobacco^lleE 

lv x culture 

Pollen spread out as a monolayer was bombarded 
essentially as previously described (Twell et ^ al .J» 1> 
Genes Pev S:49S-50 7 >, except that 9 old was 
30 tungsten and only 1 M9 of test plasmid and used per plate. 
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TXD cell culture (maintained as described by Howard et al . 
(1992) Cell 68:109-118) was spread out similarly as a 
monolayer (1 mL of a 50-mL stationary culture per plate) 
and bombarded as previously described. Between six and 12 
5 independent bombardments were performed for each construct . 
in each experiment, the test plasmid was co-bombarded with 
a reference plasmid: pB1223 (Clontech, Palo Alto, CA) was 
used for assays of all constructs in tobacco cell culture; 
pLAT59-12 (Twell et al . (1990) Development 109:705-713) for 
10 assays of LAT52 and LAT56 constructs in tobacco pollen; 
PLAT56-12 (Twell et al . (1990) Development 109:705-713) for 
assays of LAT59 constructs in tobacco pollen. Processing 
of the tissue after ~ 15 to 17 hr and analysis of GUS and 
LUC activity were as described previously (Twell et al . 
15 (1991) Genes Dev 5:496-507). Transient expression was 
reported as "relative LUC activity," which represents the 
ratio between the test (LUC) and the reference (GUS) 
plasmids . 

While the invention has been described in detail with 
20 reference to certain preferred embodiments thereof, it will 
be understood that modifications and variations are within 
the spirit and scope of that which is described and 
claimed. 
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That which is claimed is: 

1. A nucleic acid construct comprising a germline- 
specif ic promoter operatively associated with a recombinase 
coding sequence. 

2. A nucleic acid construct according to claim 1 
wherein said germline -specific promoter is the protamine 1 
gene promoter, the protamine 2 gene promoter, the 
spermatid-specific promoter from the c-kit gene, the sperm- 
specific promoter from angiotensin-converting enzyme, 
oocyte specific promoter from the ZP1 gene, oocyte specific 
promoter from the ZP2 gene, or oocyte specific promoter 
from the ZP3 gene. 

3. A nucleic acid construct according to claim 1 
wherein said germline-specif ic promoter is the LAT52 gene 
promoter from tomato, the LAT56 gene promoter from tomato, 
the LAT59 gene promoter from tomato, the pollen-specific 
promoter of the Brassica S locus glycoprotein gene, or the 
pollen- specific promoter of the NTP303 gene. 

4. A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes Cre 
recombinase . 

5. A nucleic acid construct according to claim 4 
wherein said construct is ProCre, comprising the protamine 
1 gene promoter operatively associated with Cre 
recombinase . 
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6 . a nucleic acid construct according to-d^ 
„ h erein said kinase — seance enco.es 
recombinase . 

-a construct according to claim 6 
7 a nucleic acid construct 

, „h is ProFLP, comprising the protamine 
wherein said construct i. ProFLP 
, gene promoter operatively associated 

recombinase . 

, A nucl eic acid construe, according to claim 1 
herein said reco^inase coding seance encodes the R gene 
product of Zygosaccharomyces. 

9 a nucleic acid construct according to claim . 
herein said construct is Pro*, prising the proton* 
gene propter operatives associated wrth . the ge 
product of zygosaccharomyces. 

10 A nucleic acid construct comprising a conditional 

.^ oA with a recombinase coding 
promoter operatively associated with 

sequence . 

U A nucleic acid construct comprising a tissue- 
spe cific promoter operatively associated with a recombinase 
coding sequence. 

12 . Embryonic stem cells containing a nucleic acid 
construct according to claim l. 

„ Embryonic stem cells according to claim 12 
wher ein the genome thereof comprises a transcriptionally 
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active selectable marker flanked by two recombination 
target sites. 

14. Embryonic stem cells according to claim 13 
wherein the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline- specif ic 
promoter is selective for the recombination target sites 
flanking said selectable marker. 

15. Embryonic stem cells according to claim 13 
further comprising one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination target sites which flank 

said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence, or 

a nucleic acid construct comprising a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence - 

16. Embryonic stem cells containing a nucleic acid 
construct according to claim 2. 

17. Embryonic stem cells containing a nucleic acid 
construct according to claim 3. 

18. Embryonic stem cells containing a nucleic acid 
construct according to claim 4. 
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19. Embryonic stem cells containing a nucleic acid 
construct according to claim 5 . 

20. Embryonic stem cells containing a nucleic acid 
construct according to claim 6 . 

21. Embryonic stem cells containing a nucleic acid 
construct according to claim 7 . 

22. Embryonic stem cells containing a nucleic acid 
construct according to claim 8 . 

23. Embryonic stem cells containing a nucleic acid 
construct according to claim 9 . 

24. Embryonic stem cells containing a nucleic acid 
construct according to claim 10. 

25. Embryonic stem cells according to claim 24 
wherein the genome thereof comprises a transcriptionally 
active selectable marker flanked by two recombination 
target sites. 

26. Embryonic stem cells containing a nucleic acid 
construct according to claim 11 . 

27. Embryonic stem cells according to claim 26 
wherein the genome thereof comprises a transcriptionally 
active selectable marker flanked by two recombination 
target sites. 
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28. A method for excission of the transcriptionally 
active selectable marker from the embryonic stem cells of 
claim 13, said method comprising: 

passaging the genome derived from said embryonic stem 
cells through gametogenesis . 

29. A method according to claim 28 wherein said 
genome is passaged through spermatogenesis. 

30. A method according to claim 28 wherein said 
genome is passaged through oogenesis. 

31. A method according to claim 28 wherein said 
embryonic stem cells further comprise one or more of : 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination target sites which flank 

said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence , or 

a nucleic acid construct comprising a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence . 

32. A method for the production of recombinant 
alleles, said method comprising: 

introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
cells according to claim 10, and 
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passaging the genome derived from said embryonic stem 
cells through gametogenesis. 

33. A method according to claim 32 wherein said 
nucleic acid fragment comprises an essential portion of a 
gene of interest . 

34. A method according to claim 32 wherein said 
nucleic acid fragment is introduced by homologous 
recombination, random insertion, retroviral insertion, or 
site specific-mediated recombination. 

35. A method for the production of recombinant 
alleles, said method comprising: 

introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
cells according to claim 13, and 

passaging the genome derived from said embryonic stem 
cells through gametogenesis. 

36. a method according to claim 35 wherein said 
embryonic stem cells further comprise a second nucleic acid 
construct selected from the group consisting of a construct 
comprising a conditional promoter operatively associated 
with a recombinase coding sequence and a construct 
comprising a tissue-specific promoter operatively 
associated with a recombinase coding sequence. 

37. A method according to claim 36 wherein the 
recombinase encoded by said second construct is expressed 
in response to inducing conditions. 
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39 A method according to claim 36 wherein the 
recombinase encoded by said second construct is expressed 
in a tissue selective manner. 

39 a method according to claim 35 wherein the 
recombination target sites flanking said nucleic acid 
£ ragment are recognized by a recomhinase which is expressed 
under the control of a conditional promoter or a t.ssue 
specific promoter. 

t0 . A method for the production of recombinant 
alleles, said method comprising-. 

introducing at least one recomhinase responsive construct 
into embryonic stem cells according to claim 10, 

wherein said construct (.) comprise <s) a nuclerc 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 

wherein said nucleic acid fragment is flanked by 
a second pair of recombination target sites, 

passaging the genome derived from said embryonic stem cells 
through gametogenesis . 

41 A method according to claim 40 wherein said first 
pair of recombination target sites is recognized by a 
recombinase which is expressed under the control of a 
germline-specific promoter and said second pair of 
recombination target sites is recognized by a recomb.nase 
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„„r,»-voi of a conditional 
which is express^ under the control 

promoter or a tissue specific promoter. 

42 a method according to claim 40 wherein said 
embryonic stem cells further comprise a second nucleic acid 
construct selected from the group consisting of a construct 
comprising a condition*! promoter operative^ associate 
„ ith a recombinase coding seance and a constru t 

■ ■ , tissue-specific promoter operatively 

comprising a tissue oh 

associated with a recombinase coding sequence. 

43 a method for the conditional assembly of 
functional gene(s) for expression in eu.aryotic cells by 
recombination of individual inactive gene segments from one 
or more gene(s) of interest, 

wherein each of said segments contains at least one 

recombination target site, and 

of said segments contains at 
wherein at least one ot saia * y 

least two. recombination target sites, 

said method comprising: 

introducing said individual inactive gene 
segments into an embryonic stem cell according to 
claim 10, thereby providing a DNA which encodes a 
functional gene of interest, the expression product of 
which is biologically active, upon passage of the 
gen ome derived from said stem cells through 
gametogenesis. 

44 . A method for the generation of recombinant 
livestock, said method comprising: 
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combining embryonic stem calls that include a nuclerc 
ac id construct according to claim 1 with host 
pluripotential ES cells derived from early preimplantatron 

embryos , and 

introducing these combined embryos into a host female 

and 

allowing the derived embryos to come to term. 
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A method for the generation of recombinant 
plants" said method comprising transforming plant zygotes 
with nucleic acid constructs according to claim 1 and 
allowing the zygote to develop. 
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SEQUENCE LISTING 



<110> O'Gorman, Steve 
Wahl, Geoffrey 



<120> Site-Specific Germline Recombination in 
lukaryotes and Constructs Useful Therefor 



<130> Salk2l90 

<150> 08/919,501 
<151> 1997-08-28 

<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 652 

<212> DNA 

<213> Mus musculus 



^cta^r/tccaacacc tccctcagtc = g= t«a tgtggctccc « 

Itttatacct gaagcacttg -tggggcctc -t^tac ^ „ 0 

ctctgagacc ctctggattt gtctgtcagt 9«"actgg 99 9 99 tttcaca gtt 240 

aaggtcaagt tccctcagca 9cattctctg 3«SSt!t caggacctag 300 
acaaatccat gtggctgttt cacccacctg «*ggccttg JJt acactca agt 
cctagaagca ggtgtgtggc acttaacacc ™f a ^ gat | c caaagccctg 
ggatgccatc tttgtcactt 9^9 9 g g tacagg tc ctcactggcc atggtctgtg 

ssss se ~ : « = ssss 

*4= SSSS SS= «™ « 



360 
420 
480 
540 
600 
652 



<210> 2 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<400> 2 29 
gtctagtaat gtccaacacc tccctcagt 

<210> 3 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<400> 3 31 
ctctgagcca gctcccggcc aagccagcac c 

<210> 4 
<211> 1022 
<212> DNA 

<213> Artificial Sequence 
. t33 . g l- «U««c cc,, 5 . ? .. ^Jt, » 

SEES 3=S= =SS SSS. «~ »• 



1 



tggaaaatgc 
aatggtttcc 
gtctggcagt 
ccgggctgcc 
aagaaaacgt 
tcgaccaggt 
catttctggg 
ttaaagatat 
cgctggttag 
agcgatggat 
tcagaaaaaa 
aagggatttt 
gatacctggc 
ctggagtttc 

tg 



ttctgtccgt 
cgcagaacct 
aaaaactatc 
acgaccaagt 
tgatgccggt 
tcgttcactc 
gattgcttat 
ctcacgtact 
caccgcaggt 
ttccgtctct 
tggtgttgcc 
tgaagcaact 
ctggtctgga 
aataccggag 



ttgccggtcg 
gaagatgttc 
cagcaacatt 
gacagcaatg 
gaacgtgcaa 
atggaaaata 
aacaccctgt 
gacggtggga 
gtagagaagg 
ggtgtagctg 
gcgccatctg 
catcgattga 
cacagtgccc 
atcatgcaag 



tgggcggcat 
gcgattatct 
tgggccagct 
ctgtttcact 
aacaggctct 
gcgatcgctg 
tacgtatagc 
gaatgttaat 
cacttagcct 
atgatccgaa 
ccaccagcca 
tttacggcgc 
gtgtcggagc 
ctggtggctg 



ggtgcaagtg 
tctatatctt 
aaacatgctt 
ggttatgcgg 
agcgttcgaa 
ccaggatata 
cgaaattgcc 
ccatattggc 
gggggtaact 
taactacctg 
gctatcaact 
taaggatgac 
cgcgcgagat 
gaccaatgta 



aataaccgga 
caggcgcgcg 
catcgtcggt 
cggatccgaa 
cgcactgatt 
cgtaatctgg 
aggatcaggg 
agaacgaaaa 
aaactggtcg 
ttttgccggg 
cgcgccctgg 
tctggtcaga 
atggcccgcg 
aatattgtca 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1022 



<210> 5 
<211> 2293 
<212> DNA 

<213> Artificial Sequence 



<400> 5 
gtctagtaat gtccaacacc 
atttatacct gaagcacttg 
ctctgagacc ctctggattt 
aaggtcaagt tccctcagca 
acaaatccat gtggctgttt 
cctagaagca ggtgtgtggc 
ggatgccatc tttgtcactt 
cccacccctc tcatgcccat 
aggtcctggt cctctttgac 
agggtgctgg ctcccaggcc 
cgacccaggt ggtgtcccct 
tggagcaaaa gctgatttct 
ccaatttact gaccgtacac 
aggttcgcaa gaacctgatg 
ggaaaatgct tctgtccgtt 
aatggtttcc cgcagaacct 
gtctggcagt aaaaactatc 
ccgggctgcc acgaccaagt 
aagaaaacgt tgatgccggt 
tcgaccaggt tcgttcactc 
catttctggg gattgcttat 
ttaaagatat ctcacgtact 
cgctggttag caccgcaggt 
agcgatggat ttccgtctct 
tcagaaaaaa tggtgttgcc 
aagggatttt tgaagcaact 
gatacctggc ctggtctgga 
ctggagtttc aataccggag 
tgaactatat ccgtaacctg 
gcgattagcc attaacgcgt 
gagaaaggat ttcaacatcg 
gtcgccagcc gacattgtca 
atccggtgcg tttcctgtca 
cgaggaagaa gcacggcgcg 
taaggttgga attgtcgagg 
cccgtgatat tgctgaagag 
gtatcgccgc tcccgattcg 
gaggggatcg gcaataaaaa 
tcgatccgtc gac 



tccctcagtc 
atggggcctc 
gtctgtcagt 
gcattctctg 
cacccacctg 
acttaacacc 
cttgactgtg 
atttggacat 
ttcataattc 
acagcccaca 
gctctgagcc 
gaggaggatc 
caaaatttgc 
gacatgttca 
tgccggtcgt 
gaagatgttc 
cagcaacatt 
gacagcaatg 
gaacgtgcaa 
atggaaaata 
aacaccctgt 
gacggtggga 
gtagagaagg 
ggtgtagctg 
gcgccatctg 
catcgattga 
cacagtgccc 
atcatgcaag 
gatagtgaaa 
aaatgattgc 
acggaaaata 
ctgtaaagct 
aaagtatgcg 
gttttgctaa 
ctgggtgtgg 
cttggcggcg 
cagcgcatcg 
gacagaataa 



caaacactgc 
aatgttttac 
gcctcactgg 
agcagtctga 
cctggccttg 
taagctgagt 
acacaagcaa 
ggtacaggtc 
ctaggggcca 
aaattccacc 
agctcccggc 
tgggaggacc 
ctgcattacc 
gggatcgcca 
gggcggcatg 
gcgattatct 
tgggccagct 
ctgtttcact 
aacaggctct 
gcgatcgctg 
tacgtatagc 
gaatgttaat 
cacttagcct 
atgatccgaa 
ccaccagcca 
tttacggcgc 
gtgtcggagc 
ctggtggctg 
caggggcaat 
tataattatt 
tgtagtgctg 
gagcgataga 
tagtgctgaa 
agtgatgtct 
cggaccgcta 
aatgggctga 
ccttctatcg 
aacgcacggg 



tctgcatcca 
tagagcccac 
ggcgttggat 
agatgtgtgc 
ggttatctat 
gactaactga 
ctcctgatgc 
ctcactggcc 
ctagtatcta 
tgctcacagg 
caagccagca 
caagaagaag 
ggtcgatgca 
ggcgttttct 
gtgcaagttg 
tctatatctt 
aaacatgctt 
ggttatgcgg 
agcgttcgaa 
ccaggatata 
cgaaattgcc 
ccatattggc 
gggggtaact 
taactacctg 
gctatcaact 
taaggatgac 
cgcgcgagat 
gaccaatgta 
ggtgcgcctg 
tgatatttat 
tctgtaagca 
atgcctgata 
catttcgcga 
gagtttggcg 
tcaggacata 
ccgcttcctc 
ccttcttgac 
tgttgggtcg 



tgtggctccc 
ccccctgcaa 
aatttcttaa 
tttcacagtt 
caggacctag 
acactcaagt 
caaagccctg 
atggtctgtg 
taagaggaag 
ttggctggct 
cccgggacca 
aggaaggtgt 
acgagtgatg 
gagcatacct 
aataaccgga 
caggcgcgcg 
catcgtcggt 
cggatccgaa 
cgcactgatt 
cgtaatctgg 
aggatcaggg 
agaacgaaaa 
aaactggtcg 
ttttgccggg 
cgcgccctgg 
tctggtcaga 
atggcccgcg 
aatattgtca 
ctggaagatg 
ggtgacatat 
ctaatattca 
ttgactcaat 
tgaatcccac 
aactcttggg 
gcgttggcta 
gtgctttacg 
gagttcttct 
tttgttcgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2293 



<210> 6 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<400> 6 



cccggga'tcaYttcaccatg ggaataactt cgtatagcat acattatacg aagttatgga 
tccgccgcta tcaggacata gcgttg 



60 
86 



<210> 7 
<211> 4172 
<212> DNA 

<213> Artificial Sequence 



<400> 7 
gcacttttcg gggaaatgtg 
atatgtatcc gctcatgaga 
agagtatgag tattcaacat 
ttcctgtttt tgctcaccca 
gtgcacgagt gggttacatc 
gccccgaaga acgttttcca 
tatcccgtat tgacgccggg 
acttggttga gtactcacca 
aattatgcag tgctgccata 
cgatcggagg accgaaggag 
gccttgatcg ttgggaaccg 
cgatgcctgt agcaatggca 
tagcttcccg gcaacaatta 
tgcgctcggc ccttccggct 
ggtctcgcgg tatcattgca 
tctacacgac ggggagtcag 
gtgcctcact gattaagcat 
ttgatttaaa acttcatttt 
tcatgaccaa aatcccttaa 
agatcaaagg atcttcttga 
aaaaaccacc gctaccagcg 
cgaaggtaac tggcttcagc 
agttaggcca ccacttcaag 
tgttaccagt ggctgctgcc 
gatagttacc ggataaggcg 
gcttggagcg aacgacctac 
ccacgcttcc cgaagggaga 
gagagcgcac gagggagctt 
ttcgccacct ctgacttgag 
ggaaaaacgc cagcaacgcg 
acatgttctt tcctgcgtta 
gagctgatac cgctcgccgc 
cggaagagcg cccaatacgc 
gctggcacga caggtttccc 
gttagctcac tcattaggca 
gtggaattgt gagcggataa 
agctcgaaat taaccctcac 
gatcaattca ccatgggaat 
tcgagcagtg tggttttgca 
tccacccaat gtcgagcagt 
cctggaatgt ttccacccaa 
tcgaacacgc agatgcagtc 
acgcgtgtgg cctcgaacac 
agatggattg cacgcaggtt 
ggcacaacag acaatcggct 
cccggttctt tttgtcaaga 
agcgcggcta tcgtggctgg 



cgcggaaccc ctatttgttt 
caataaccct gataaatgct 
ttccgtgtcg cccttattcc 
gaaacgctgg tgaaagtaaa 
gaactggatc tcaacagcgg 
atgatgagca cttttaaagt 
caagagcaac tcggtcgccg 
gtcacagaaa agcatcttac 
accatgagtg ataacactgc 
ctaaccgctt ttttgcacaa 
gagctgaatg aagccatacc 
acaacgttgc gcaaactatt 
atagactgga tggaggcgga 
ggctggttta ttgctgataa 
gcactggggc cagatggtaa 
gcaactatgg atgaacgaaa 
tggtaactgt cagaccaagt 
taatttaaaa ggatctaggt 
cgtgagtttt cgttccactg 
gatccttttt ttctgcgcgt 
gtggtttgtt tgccggatca 
agagcgcaga taccaaatac 
aactctgtag caccgcctac 
agtggcgata agtcgtgtct 
cagcggtcgg gctgaacggg 
accgaactga gatacctaca 
aaggcggaca ggtatccggt 
ccagggggaa acgcctggta 
cgtcgatttt tgtgatgctc 
gcctttttac ggttcctggc 
tcccctgatt ctgtggataa 
agccgaacga ccgagcgcag 
aaaccgcctc tccccgcgcg 
gactggaaag cgggcagtga 
ccccaggctt tacactttat 
caatttcaca caggaaacag 
taaagggaac aaaagctggg 
aacttcgtat agcatacatt 
agaggaagca aaaagcctct 
gtggttttgc aagaggaagc 
tgtcgagcaa accccgccca 
ggggcggcgc ggtcccaggt 
cgagcgaccc tgcagccaat 
ctccggccgc ttgggtggag 
gctctgatgc cgccgtgttc 
ccgacctgtc cggtgccctg 
ccacgacggg cgttccttgc 



atttttctaa atacattcaa 
tcaataatat tgaaaaagga 
cttttttgcg gcattttgcc 
agatgctgaa gatcagttgg 
taagatcctt gagagttttc 
tctgctatgt ggcgcggtat 
catacactat tctcagaatg 
ggatggcatg acagtaagag 
ggccaactta cttctgacaa 
catgggggat catgtaactc 
aaacgacgag cgtgacacca 
aactggcgaa ctacttactc 
taaagttgca ggaccacttc 
atctggagcc ggtgagcgtg 
gccctcccgt atcgtagtta 
tagacagatc gctgagatag 
ttactcatat atactttaga 
gaagatcctt tttgataatc 
agcgtcagac cccgtagaaa 
aatctgctgc ttgcaaacaa 
agagctacca actctttttc 
tgtccttcta gtgtagccgt 
atacctcgct ctgctaatcc 
taccgggttg gactcaagac 
gggttcgtgc acacagccca 
gcgtgagcta tgagaaagcg 
aagcggcagg gtcggaacag 
tctttatagt cctgtcgggt 
gtcagggggg cggagcctat 
cttttgctgg ccttttgctc 
ccgtattacc gcctttgagt 
cgagtcagtg agcgaggaag 
ttggccgatt cattaatgca 
gcgcaacgca attaatgtga 
gcttccggct cgtatgttgt 
ctatgaccat gattacgcca 
tacgaattca gatctcccgg 
atacgaagtt atggatccgg 
ccacccaggc ctggaatgtt 
aaaaagcctc tccacccagg 
gcgtcttgtc attggcgaat 
ccacttcgca tattaaggtg 
atgggatcgg ccattgaaca 
aggctattcg gctatgactg 
cggctgtcag cgcaggggcg 
aatgaactgc aggacgaggc 
gcagctgtgc tcgacgttgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 



ca c t gaagc g g gaagggac t gg = £™ ™S SX 

atctcacott gctcct g cc g ^aaa g tatc «^atgg g « tc g a g c g agc 3000 

tacgcttgat ccggctacct gcccattcga " a «^ » | ag catca ggg 3060 

.egLetagg atggaagccg 9tcttgtog. t« gg at g at ^ fc 3120 

qctcgc g cca g ccgaactgt togccag g ct caa 99cgcgc g » gcC g C ttttc 3180 

9 cgtS4acc catggcgatg cctgcttgcc 9-£>^£ f a f c ^ g f ca Lgittggc 3240 

tggattcatc gactgtggoc cjgctgggtgt 99=99*"^ tac 99 tcgt gcttta 3300 

'tacccgtgat attgctgaag agcttggcgg «>£9»* 9-c«£'~ ^ 

cggtatcgcc gctcccgatt cgcagcgcat =9«ttctat 9 cgtttgtteg 3420 

ctgaggggat cggcaataaa »9^9»t ^ aa ^cac g gg g ^ atgg 3480 

gatagg g atc aattcaccat gggaataact ^tatagc tatagtgagt 3540 

atccactagt tctagagcgg ccgccaccgc JW^ "gggalaac cctggcgtta 3600 

cgtattacaa ttcactggcc gtcgttttac ^tcgtga 9ffl agcgaagag g 3660 

cLaacttaa tcgccttgca 9cacatcccc =^9 gacgc gccct 3720 

cccgcaccga tcgcccttcc caacagttgc ^cctgaa ^ acttg 3780 

gtagcggcgc attaagcgcg sogg**^ tggttaogcg 9^ acgttcgccg 38 40 

cca g c g ccct a g o g ccc g ct cctttcgett "ttcccttc agtgctttac 3900 

gc tttccccg tcaagctcta aatcgggggc tccctttaf 39S0 

Lcacctcga ccccaaaaaa ottgattagg S^f^" ctttaatagt ggactcttgt 4020 

gatagacggt ttttcgccet ttgaejttjg jgtc-egtt ctttaat^ gg^^ o 

tccaaactgg aacaacactc »"«^ Sgattta acaaaaattt aacgcgaatt — 
tgccgatttc ggoctattgg ttaaaaaatg a gct g at 
ttaacaaaat attaacgctt acaatttagg tg 
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<210> 8 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

<400> 8 34 
ataacttccjt atagcataca ttatacgaag ttat 
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